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HPV31 LI total rebuild nucleotide and mint, acid sequence* 


601 


M V 0 T GFG AMD FTAL Q D T 
ATGSTCGACA CCGGTTTCGG TGCTATSGAC TTCACCGCTT TSCAAGACAC 


1 


MSIK RPS EAT V Y I P P V P 
ATGItTTTST GGAGACCATC TGAAGCTACC GTUACTTSC CACCAGTCCC 


651 


KSN V P L D ICN SIC KYP0 
CAAGTCTAAC GTCCCATTGG ACATCTGTAA CTCTATCTGT AAGTACCCAG 


51 


VSK V V S T 0 E Y VTR T N I Y 
AGTCTCTAAS GTCGTCTCTA CCGACGAATA CGTCACCAGA ACCAACATtT 


701 


YLK M V A EPYG DTI FFY 
ACTACTTGAA GATGGTCQCT GAACCATACG GC6ACACCTT GTTCTTCTAC 


101 


YHA GSA RLLT V6H P Y Y 
ACTACCACGC TGGTTCTGCT AGATTGTTGA CCGTCGGTCA CCCATACTAC 


751 


L R ft £ 0HF VRN FFNR SGT 
TTGC6TAGA6 AACAGATGTT CGTAAGSCAC TTCTTCAACA GATCCGQCAC 


151 


SIPX SDN PICK IVVP KVS 
TCTATCCCM ASTCTSACAA CCCAAA6AAG ATCGTCGTCC CAAAGGTCTC 


801 


V 6 E SVPT DLY IKG S6ST 
CSTAGGTGAA TCTGTCCCAA CCGACCTSTA CATCAASGGC KCGSTTCCA 


201 


GL0 Y R V F R V P. L P D PNKF 
TGGTTTGCAA TACAGAGTCT TCAGAGTCA6 ATTGCCAGAC CCAAACAA6T 


851 


ATL A N S TYFP TPS GSM 
CCGCTACCO GGCTAACTCC ACCTACTTCC CAACTCCATC TGGCTCCATG 


251 


GFP 0TS FYNP E T Q RLV 
TCGGTTTCCC AGACACCTCT TTCTACAACC CAGAAACCCA AAGATTSGTC 


901 


VTSD AOI FKK PYWM Q R A 
GTCACCTCCS ACGCTCA6AT CTTCAACAAG CCATACTGGA TGCAGCGTGC 


301 


WACV GLE V 6 R G Q P L GVG 
TGG6CTTGTG TCGGTTTGGA AGTCGGTAGA GSTCAACCAT TSSSTGTCGG 


951 


Q6H NN6I C W 6 NQL F V T V 
ACAG6GTCAC AACAACGGTA TCTGTTGGGG TAACCAGCTG TTCGTGACTG 


351 


ISG HPLL MKF DOT ENSN 
TATCTCTGGT OCCCAnGT T6AAGAAGTT CGACGACACC GAAAACTCTA 


1001 


VDT TRS TNHS VCA A I A 
T6GTC6ATAC CACGCGTTCT ACCAACATGT CTGTCTSTGC TGCAATCGCT 


401 


RYA GGP GTDN RFC ISM 
ACAGATACGC TGGTGGTCCA GGTACCGACA ACAGAGMTG TATCTCTATG 


10S1 


WSDT TFK SSR FKEY LRH 
AACTCTGACA CTACCTTCAA 6TCCTCTAAC TTCAAGGA6T ACCTGAGACA 


451 


OYKQ T Q L CLL 6 C K P PIG 
GACTACAAGC AAACCCAATT falbHIUIb GSTTGTAAGC CACCAATCGG 


1101 


GEE F 0 L Q F1F QIC ICITL 
TG5TGAGGAA TTCGATCTGC AATTCATCTT CCAGTT6TGC AAGATCACCC 


501 


EHH GRGS PCS N * A ITPG 
TGAACACTG6 GGTAAGGGTT CTCCATGTTC TAACAACGCT ATCACCCCAG 


1151 


SAD INT YIHS MHP AIL 
TGTCTGCTGA CATCATGACC TACATCCACA GTATGAACCC TECCATCCTG 


551 


0CP PIELKHSVI0 0GO 
GTSACTGTCC ACCATTG6AA TTGAAGAACT CTGTCATCCA AGACGGTGAC 


1201 


EDWN FGl TTP PSGS LEO 
6AGGACTGGA ACTTCGGTCT GACCACTCCA CCTTCCGGT7 CTTTGGAAGA 
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^ (57) Abstract: Synthetic DNA molecules encoding the HPV31 LI protein are provided. Specifically, the present invention provides 
O polynucleotides encoding HPV31 LI protein, wherein said polynucleotides are free from internal transcription termination signals 
O that are recognized by yeast. Also provided are synthetic polynucleotides encoding HPV31 LI wherein the polynucleotides have 
^ been codon-optimized for high level expression in a yeast cell. The synthetic molecules may be used to produce HPV31 virus- 
Q like particles (VLPs), and to produce vaccines and pharmaceutical compositions comprising the HPV31 VLPs. The vaccines of 
^ the present invention provide effective immunoprophylaxis against papillomavirus infection through neutralizing antibody and cell- 
J^- mediated immunity. 
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For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 
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TITLE OF THE INVENTION 

OPTIMIZED EXPRESSION OF HPV 3 1 LI IN YEAST 

CROSS-REFERENCE TO RELATED APPLICATIONS 
5 This application claims the benefit of U.S. Provisional Application No. 60/457,172 filed 

March 24, 2003, the contents of which are incorporated herein by reference in their entirety. 

FIELD OF THE INVENTION 

The present invention relates generally to the therapy of human papillomavirus (HPV). 
10 More specifically, the present invention relates to synthetic polynucleotides encoding HPV3 1 LI protein, 
and to recombinant vectors and hosts comprising said polynucleotides. This invention also relates to 
HPV31 virus-like particles (VLPs) and to their use in vaccines and pharmaceutical compositions for 
preventing and treating HPV. 

15 BACKGROUND OF THE INVENTION 

There are more than 80 types of human papillomavirus (HPV), many of which have been 
associated with a wide variety of biological phenotypes, from benign proliferative warts to malignant 
carcinomas (for review, see McMurray et al, Int. J. Exp. Pathol 82(1): 15-33 (2001)). HPV6 and 
HPV1 1 are the types most commonly associated with benign warts, nonmalignant condylomata 

20 acuminate and/or low-grade dysplasia of the genital or respiratory mucosa. HPV1 6 and HPV1 8 are the 
high-risk types most frequently associated with in situ and invasive carcinomas of the cervix, vagina, 
vulva and anal canal. More than 90% of cervical carcinomas are associated with infections of HPV16, 
HPV 18 or the less prevalent oncogenic types HPV31, -33, -45, -52 and -58 (Schiffinan et al., J. Natl 
Cancer Inst. 85(12): 958-64 (1993)). The observation that HPV DNA is detected in 90-100% of cervical 

25 cancers provides strong epidemiological evidence that HPVs cause cervical carcinoma (see Bosch et al., 
J. Clin. Pathol 55: 244-265 (2002)). 

Papillomaviruses are small (50-60 nm), nonenveloped, icosahedral DNA viruses that 
encode up to eight early and two late genes. The open reading frames (ORFs) of the viral genomes are 
designated El to E7, and LI and L2, where "E M denotes early and "L" denotes late. LI and L2 code for 

30 virus capsid proteins, while the E genes are associated with functions such as viral replication and cellular 
transformation. 

The LI protein is the major capsid protein and has a molecular weight of 55-60 kDa. The 
L2 protein is a minor capsid protein. Immunological data suggest that most of the L2 protein is internal 
to the LI protein. Both the LI and L2 proteins are highly conserved among different papillomaviruses. 

-1- 
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Expression of the LI protein or a combination of the LI and L2 proteins in yeast, insect 
cells, mammalian cells or bacteria leads to self-assembly of virus-like particles (VLPs) (for review, see 
Schiller and Roden, in Papillomavirus Reviews: Current Research on Papillomaviruses; Lacey, ed. 
Leeds, UK: Leeds Medical Information, pp 101-12 (1996)). VLPs are morphologically similar to 
5 authentic virions and are capable of inducing high titers of neutralizing antibodies upon administration 
into an animal or a human. Because VLPs do not contain the potentially oncogenic viral genome, they 
present a safe alternative to use of live virus in HPV vaccine development (for review, see Schiller and 
Hidesheim, 7. Clin. Virol. 19: 67-74 (2000)). For this reason, the LI and L2 genes have been identified as 
immunological targets for the development of prophylactic and therapeutic vaccines for HPV infection 
10 and disease. 

HPV vaccine development and commercialization have been hindered by difficulties * 
associated with obtaining high expression levels of capsid proteins in successfully transformed host 
organisms, limiting the production of purified protein. Therefore, despite the identification of wild-type 
nucleotide sequences encoding HPV LI proteins such as HPV31 LI proteins (Goldsborough et al., 
15 Virology 171(1): 306-31 1 (1989), it would be highly desirable to develop a readily renewable source of 
crude HPV proteins that utilizes HPV3 1 LI -encoding nucleotide sequences that are optimized for 
expression in the intended host cell Additionally, it would be useful to produce large quantities of 
HPV31 LI VLPs having the immunity-conferring properties of the native proteins for use in vaccine 
development. 

20 

SUMMARY OF THE INVENTION 

The present invention relates to compositions and methods to elicit or enhance immunity 
to the protein products expressed by HPV31 LI genes, which have been associated with cervical cancer. 
Specifically, the present invention provides polynucleotides encoding HPV31 LI protein, wherein said 

25 polynucleotides are free from internal transcription termination signals that are recognized by yeast. Also 
provided are synthetic polynucleotides encoding HPV31 LI wherein the polynucleotides have been 
codon-optimized for high level expression in a yeast cell. The present invention further provides HPV3 1 
virus-like particles (VLPs) and discloses use of said VLPs in immunogenic compositions and vaccines for 
the prevention and/or treatment of HPV disease or HPV-associated cancer. 

30 The present invention relates to synthetic DNA molecules encoding the HPV31 LI 

protein. In one aspect of the invention, the nucleotide sequence of the synthetic molecule is altered to 
eliminate transcription termination signals that are recognized by yeast. In another aspect, the codons of 
the synthetic molecules are designed so as to use the codons preferred by a yeast cell. The synthetic 
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molecules may be used as a source of HPV3 1 LI protein, which may self-assemble into VLPs. Said 
VLPs may be used in a VLP-based vaccine. 

A particular embodiment of the present invention comprises a synthetic nucleic acid 
molecule which encodes the HPV31 LI protein as set forth in SEQ ID NO:4, said nucleic acid molecule 
5 comprising a sequence of nucleotides as set forth in SEQ ID NO:2 or SEQ ID NO:3. 

As stated above, provided herein are synthetic polynucleotides encoding the HPV3 1 LI 
gene which are free from transcription termination signals that are recognized by yeast. This invention 
also provides synthetic polynucleotides encoding HPV 3 1 LI as described, which are further altered so as 
to contain codons that are preferred by yeast cells. 
1 0 Also provided are recombinant vectors and recombinant host cells, both prokaryotic and 

eukaryotic, which contain the nucleic acid molecules disclosed throughout this specification. 

The present invention relates to a process for expressing an HPV3 1 LI protein in a 
recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid encoding an HPV31 
LI protein into a yeast host cell; wherein the nucleic acid molecule is free from internal transcription 
15 termination signals that are recognized by yeast and; (b) culturing the yeast host cell under conditions 
which allow expression of said HPV31 LI protein. 

The present invention further relates to a process for expressing an HPV31 LI protein in 
a recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid encoding an 
HPV31 LI protein into a yeast host cell; wherein the nucleic acid molecule is codon-optimized for 
20 optimal expression in the yeast host cell and; (b) culturing the yeast host cell under conditions which 
allow expression of said HPV31 LI protein. 

In preferred embodiments, the nucleic acid comprises a sequence of nucleotides as set 
forth in SEQ ID NO:2 or SEQ ID NO:3. 

This invention also relates to HPV31 virus-like particles (VLPs), methods of producing 
25 HPV3 1 VLPs, and methods of using HPV3 1 VLPs. 

In a preferred embodiment of the invention, the HPV3 1 VLPs are produced in yeast. In a 
further preferred embodiment, the yeast is selected from the group consisting of: Saccharomyces 
cerevisiae, Hansenula polymorphs Pichia pastoris, Kluyvermyces fragilis, Kluveromyces lactis, and 
Schizosaccharomyces pombe. 
30 Another aspect of this invention is an HPV3 1 VLP, which comprises an HPV3 1 LI 

protein produced by a HPV31 LI gene which is free from transcription termination signals that are 
recognized by yeast. 

Yet another aspect of this invention is an HPV31 VLP, which comprises an HPV31 LI 
protein produced by a codon-optimized HPV31 LI gene. In a preferred embodiment of this aspect of the 
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invention, the codon-optimized HPV31 LI gene consists essentially of a sequence of nucleotides as set 
forth in SEQ ID NO:2 or SEQ ID NO:3. 

This invention also provides a method for inducing an immune response in an animal 
comprising administering HPV31 virus-like particles to the animal. In a preferred embodiment, the 
5 HP V3 1 VLPs are produced by a codon-optimized gene. In a further preferred embodiment, the HPV3 1 
VLPs are produced by a gene that is free from transcription termination sequences that are recognized by 
yeast. 

Yet another aspect of this invention is a method of preventing or treating HPV-associated 
cervical cancer comprising administering to a mammal a vaccine comprising HPV31 VLPs. In a 
10 preferred embodiment of this aspect of the invention, the HPV3 1 VLPs are produced in yeast. 

This invention also relates to a vaccine comprising HPV31 virus-like particles (VLPs). 
In an alternative embodiment of this aspect of the invention, the vaccine further 
comprises VLPs of at least one additional HPV type. In a preferred embodiment, the at least one 
additional HPV type is selected from the group consisting of: HPV6, HPV1 1, HPV16, HPV18, HPV33, 
15 HPV35, HPV39, HPV45, HPV51, HPV52, HPV55, HPV56, HPV58, HPV59, and HPV68. 

This invention also relates to pharmaceutical compositions comprising HPV 31 virus-like 
particles. Further, this invention relates to pharmaceutical compositions comprising HP V3 1 VLPs and 
VLPs of at least one additional HPV type. In a preferred embodiment, the at least one additional HPV 
type is selected from the group consisting of: HPV6, HPV1 1, HPV 16, HPV1 8, HPV33, HPV35, HPV39, 
20 HPV45, HPV51, HPV52, HPV55, HPV56, HPV58, HPV59, and HPV68. 

As used throughout the specification and in the appended claims, the singular forms "a," 
"an," and "the" include the plural reference unless the context clearly dictates otherwise. 

As used throughout the specification and appended claims, the following definitions and 
25 abbreviations apply: 

The term "promoter" refers to a recognition site on a DNA strand to which the RNA 
polymerase binds. The promoter forms an initiation complex with RNA polymerase to initiate and drive 
transcriptional activity. The complex can be modified by activating sequences termed "enhancers" or 
"upstream activating sequences" or inhibiting sequences termed "silencers". 
30 The term "vector" refers to some means by which DNA fragments can be introduced into 

a host organism or host tissue. There are various types of vectors including plasmid, virus (including 
adenovirus), bacteriophages and cosmids. 

The designation "31 LI wild-type sequence" refers to the HPV31 LI sequence disclosed 
herein as SEQ ID NO:l. Although the HPV 3 1 LI wild-type sequence has been described previously, it 
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is not uncommon to find minor sequence variations between DNAs obtained from clinical isolates. 
Therefore, a representative HPV31 LI wild-type sequence was isolated from clinical samples previously 
shown to contain HPV 31 DNA (see EXAMPLE 1). The 31 LI wild-type sequence was used as a 
reference sequence to compare the codon-optimized HPV 31 LI sequences disclosed herein (see FIGURE 
5 1). 

The designation "3 1 LI partial rebuild" refers to a construct, disclosed herein (SEQ ID 
NO:2), in which the HPV3 1 LI nucleotide sequence was partially rebuilt to contain yeast-preferred 
codons for optimal expression in yeast. The 31 LI partial rebuild comprises alterations in the middle 
portion of the HPV 31 LI wild-type nucleotide sequence (nucleotides 697-1249). The complete HPV 31 

10 LI sequence was also rebuilt with yeast-preferred codons, which is referred to herein as the "3 1 LI total 
rebuild" (SEQ ID NO:3). 

The term "effective amount" means sufficient vaccine composition is introduced to 
produce the adequate levels of the polypeptide, so that an immune response results. One skilled in the art 
recognizes that this level may vary. 

15 A "conservative amino acid substitution" refers to the replacement of one amino acid 

residue by another, chemically similar, amino acid residue. Examples of such conservative substitutions 
are: substitution of one hydrophobic residue (isoleucine, leucine, valine, or methionine) for another; 
substitution of one polar residue for another polar residue of the same charge (e.g., arginine for lysine; 
glutamic acid for aspartic acid). 

20 The term "mammalian" refers to any mammal, including a human being. 

"VLP" or "VLPs" mean(s) virus-like particle or virus-like particles. 
"Synthetic" means that the HPV3 1 LI gene has been modified so that it contains a 
sequence of nucleotides that is not the same as the sequence of nucleotides present in the naturally 
occurring wild-type HPV3 1 LI gene. As stated above, synthetic molecules are provided herein 

25 comprising a sequence of nucleotides that are altered to eliminate transcription termination signals 

recognized by yeast. Also provided herein are synthetic molecules comprising codons that are preferred 
for expression by yeast cells. The synthetic molecules provided herein encode the same amino acid 
sequences as the wild-type HPV3 1 LI gene. 

30 BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 is a sequence alignment showing nucleotides that were altered in the partial 
(SEQ ID NO:2) and total rebuild (SEQ ID NO:3) 31 LI genes (See EXAMPLE 2). The reference 
sequence is the 31 LI wild-type sequence (SEQ ID NO:l; see EXAMPLE 1). Nucleotides in the 31 LI 
partial and total rebuild sequences that are identical to the reference sequence are indicated with dots. 
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Altered nucleotides are indicated at their corresponding location. Nucleotide number is contained within 
the parentheses. 

FIGURE 2 shows the 31 LI total rebuild nucleotide (SEQ ID NO:3) and amino acid 
sequences (SEQ ID NO:4). The nucleotide number is indicated on the left. 
5 FIGURE 3 summarizes the changes between the three HPV 3 1 LI sequence constructs, 

which are listed on the left. The fourth column indicates the percent nucleotide identity between the 
indicated construct and the 31 LI wild-type sequence and the fifth column indicates the amino acid 
identity. The last column indicates the number of nucleotides that were altered to yeast-preferred codon 
sequences and the region where the alterations were made. 

10 FIGURE 4 shows a Northern blot probed specifically for HPV 3 1 LI under high 

stringency (see EXAMPLE 4). Arrows on the left indicate the position of the HPV 31 LI flxll length and 
truncated transcripts. Lanes labeled "3 1 wt" are from the same RNA preparation of yeast containing 3 1 
LI wild-type sequences. The lane labeled "16" contains RNA from HPV16, which is not recognized by 
the HPV 3 1 LI probe because of the high stringency conditions. The lane labeled "Neg" is a yeast extract 

15 containing no LI coding sequences. Lanes labeled "31R" are from RNA of two separate isolated 
colonies expressing the 3 1 LI partial rebuild sequence. 

FIGURE 5 shows a portion of the data from two capture radioimmunoassay (RIA) 
experiments in counts per minute (cpm)/mg total protein (see EXAMPLE 7). Cpm obtained in the RIA 
are a relative indicator of HPV 31 LI VLPs. The RIA data demonstrate increased 3 1 LI VLP expression 

20 in yeast protein extracts from yeast-preferred codon rebuilt gene sequences. 

FIGURE 6 shows a representative sample of the 31 LI VLPs described herein, as 
visualized by transmission electron microscopy (see EXAMPLE 8). The bar represents 100 nm. 

DETAILED DESCRIPTION OF THE INVENTION 

25 The majority of cervical carcinomas are associated with infections of specific oncogenic 

types of human papillomavirus (HPV). The present invention relates to compositions and methods to 
elicit or enhance immunity to the protein products expressed by genes of oncogenic HPV types. 
Specifically, the present invention provides polynucleotides encoding HPV31 LI and HPV31 virus-like 
particles (VLPs) and discloses use of said polynucleotides and VLPs in immunogenic compositions and 

30 vaccines for the prevention and/or treatment of HPV-associated cancer. 

The wild-type HPV3 1 LI nucleotide sequence has been reported (Goldsborough et al., 
Virology 171(1): 306-31 1 (1989); Genbank Accession # J04353). The present invention provides 
synthetic DNA molecules encoding the HPV31 LI protein. The synthetic molecules of the present 
invention comprise a sequence of nucleotides, wherein some of the nucleotides have been altered so as to 
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eliminate transcription termination signals that are recognized by yeast. In alternative embodiments, the 
codons of the synthetic molecules are designed so as to use the codons preferred by a yeast cell for high- 
level expression. The synthetic molecules may be used as a source of HPV3 1 LI protein, which may 
self-assemble into VLPs. Said VLPs may be used in a VLP-based vaccine to provide effective 
5 immunoprophylaxis against papillomavirus infection through neutralizing antibody and cell-mediated 
immunity. Such VLP-based vaccines are also useful for treatment of already established HPV infections. 

Expression of HPV VLPs in yeast cells offers the advantages of being cost-effective and 
easily adapted to large-scale growth in fermenters. However, many HPV LI proteins, including HPV31 
LI (see EXAMPLE 4), are expressed at low levels in yeast cells. It has been determined in accordance 

10 with the present invention that low level expression of HPV3 1 LI is due to truncation of the mRNA 
transcript resulting from the presence of transcription termination signals that are recognized by yeast. 
By altering the HPV31 LI DNA to eliminate any potential sequences resembling yeast transcription 
termination sites, it is possible to facilitate the transcription of full-length mRNA resulting in increased 
HPV31 LI protein expression. 

15 Accordingly, in some embodiments of this invention, alterations have been made to the 

HPV31 LI DNA to eliminate any potential sequences resembling yeast transcription termination signals. 
These alterations allow expression of the full-length HPV31 transcript, as opposed to a truncated 
transcript (see EXAMPLE 4), improving expression yield. 

As noted above, synthetic DNAs of the present invention comprise alterations from the 

20 wild-type HPV3 1 LI sequence that were made to eliminate yeast-recognized transcription termination 
sites. One of skill in the art will recognize that additional DNA molecules can be constructed that encode 
the HPV31 LI protein, but do not contain yeast transcription termination sites. Techniques for finding 
yeast transcription termination sequences are well known in the art. Transcription termination and 3' end 
formation of yeast mRNAs requires the presence of three signals: (1) an efficiency element such as 

25 TATATA or related sequences, which enhances the efficiency of positioning elements located 

downstream; (2) positioning element(s), which determine the location of the poly(A) site and (3) the 
polyadenylation site (usually Py(A)n). 

The scientific literature is replete with descriptions of sequences that encode yeast 
transcription termination signals. See, for example, Guo and Sherman, Trends Biochem. Sci. 21: 477-481 

30 (1986); Guo and Sherman, Mol Cell Biol 16(6): 2772-2776 (1996); Zaret et al, Cell 28:563-573 (1982); 
Henikoff et al, Cell 33:607-614 (1983); Thalenfeld et al, J. Biol Chem. 258(23):14065-14068 (1983); 
Zaret etal,/.M>/. Biol 176:107-135 (1984); Heidmann et al, Mol CellBiol 14:4633-4642 (1984); and 
Russo, Yeast 11:447-453 (1985). Therefore, one of skill in the art would have no difficulty determining 
which sequences to avoid in order to construct a synthetic HPV3 1 LI gene that produces a full-length 
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mRNA transcript in accordance with the present invention. Additionally, assays and procedures to assess 
whether a yeast transcription termination sequence is present within the synthetic sequence are well 
established in the art, so that an ordinary skilled artisan would be able to determine if a constructed 
HPV31 LI sequence comprises termination sequences that need to be eliminated. 
5 As described above, the present invention relates to a nucleic acid molecule encoding 

HPV type 3 1 LI protein, the nucleic acid molecule being free from internal transcription termination 
signals which are recognized by yeast. In exemplary embodiments of the invention, the synthetic nucleic 
acid molecules comprise a sequence of nucleotides as set forth in SEQ ID NO:2 or SEQ ID NO:3. 

In alternative embodiments of the present invention, HPV31 LI gene sequences are 

10 "optimized" for high level expression in a yeast cellular environment. Codon-optimized HPV31 LI genes 
contemplated by the present invention include synthetic molecules encoding HPV31 LI that are free from 
internal transcription termination signals which are recognized by yeast, further comprising at least one 
codon that is codon-optimized for high level expression in yeast cells. 

A "triplet" codon of four possible nucleotide bases can exist in over 60 variant forms. 

15 Because these codons provide the message for only 20 different amino acids (as well as transcription 
initiation and termination), some amino acids can be coded for by more than one codon, a phenomenon 
known as codon redundancy. For reasons not completely understood, alternative codons are not 
uniformly present in the endogenous DNA of differing types of cells. Indeed, there appears to exist a 
variable natural hierarchy or "preference" for certain codons in certain types of cells. As one example, 

20 the amino acid leucine is specified by any of six DNA codons including CTA, CTC, CTG, CTT, TTA, 
and TTG. Exhaustive analysis of genome codon frequencies for microorganisms has revealed 
endogenous DNA of E. coli most commonly contains the CTG leucine-specifying codon, while the DNA 
of yeasts and slime molds most commonly includes a TTA leucine-specifying codon. In view of this 
hierarchy, it is generally believed that the likelihood of obtaining high levels of expression of a leucine- 

25 rich polypeptide by an E. coli host will depend to some extent on the frequency of codon use. For 

example, it is likely that a gene rich in TTA codons will be poorly expressed in E. coli, whereas a CTG 
rich gene will probably be highly expressed in this host. Similarly, a preferred codon for expression of a 
leucine-rich polypeptide in yeast host cells would be TTA. 

The implications of codon preference phenomena on recombinant DNA techniques are 

30 manifest, and the phenomenon may serve to explain many prior failures to achieve high expression levels 
of exogenous genes in successfully transformed host organisms-a less "preferred" codon may be 
repeatedly present in the inserted gene and the host cell machinery for expression may not operate as 
efficiently. This phenomenon suggests that synthetic genes which have been designed to include a 
projected host cell's preferred codons provide an optimal form of foreign genetic material for practice of 
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recombinant DNA techniques. Thus, one aspect of this invention is an HPV31 LI gene that is codon- 
optimized for expression in a yeast cell. In a preferred embodiment of this invention, it has been found 
that the use of alternative codons encoding the same protein sequence may remove the constraints on 
expression of HPV31 LI proteins by yeast cells. 
5 In accordance with this invention, HPV31 LI gene segments were converted to 

sequences having identical translated sequences but with alternative codon usage as described by Sharp 
and Cowe (Synonymous Codon Usage in Saccharomyces cerevisiae. Yeast 7: 657-678 (1991)), which is 
hereby incorporated by reference. The methodology generally consists of identifying codons in the wild- 
type sequence that are not commonly associated with highly expressed yeast genes and replacing them 

10 with optimal codons for high expression in yeast cells. The new gene sequence is then inspected for 
undesired sequences generated by these codon replacements (e.g., "ATTTA" sequences, inadvertent 
creation of intron splice recognition sites, unwanted restriction enzyme sites, etc.). Undesirable 
sequences are eliminated by substitution of the existing codons with different codons coding for the same 
amino acid. The synthetic gene segments are then tested for improved expression. 

15 The methods described above were used to create synthetic gene segments for HPV3 1 

LI, resulting in a gene comprising codons optimized for high level expression. While the above 
procedure provides a summary of our methodology for designing codon-optimized genes for use in HPV 
vaccines, it is understood by one skilled in the art that similar vaccine efficacy or increased expression of 
genes may be achieved by minor variations in the procedure or by minor variations in the sequence. 

20 Accordingly, the present invention relates to a synthetic polynucleotide comprising a 

sequence of nucleotides encoding an HPV31 LI protein, or a biologically active fragment or mutant form 
of an HPV31 LI protein, the polynucleotide sequence comprising codons optimized for expression in a 
yeast host. Said mutant forms of the HPV31 LI protein include, but are not limited to: conservative 
amino acid substitutions, amino-terminal truncations, carboxy-terminal truncations, deletions, or 

25 additions. Any such biologically active fragment and/or mutant will encode either a protein or protein 
fragment which at least substantially mimics the immunological properties of the HPV31 LI protein as 
set forth in SEQ ID NO:4. The synthetic polynucleotides of the present invention encode mRNA 
molecules that express a functional HPV3 1 LI protein so as to be useful in the development of a 
therapeutic or prophylactic HPV vaccine. 

30 One aspect of this invention is a codon-optimized nucleic acid molecule which encodes 

the HPV31 LI protein as set forth in SEQ ID NO:4, said nucleic acid molecule comprising a sequence of 
nucleotides as set forth in SEQ ID NO:2. 
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Another aspect of this invention is a codon-optimized nucleic acid molecule which 
encodes the HPV3 1 LI protein as set forth in SEQ ID NO:4, said nucleic acid molecule comprising a 
sequence of nucleotides as set forth in SEQ ID NO:3. 

The present invention also relates to recombinant vectors and recombinant host cells, 
5 both prokaryotic and eukaryotic, which contain the nucleic acid molecules disclosed throughout this 
specification. 

The synthetic HPV3 1 DNA or fragments thereof constructed through the methods 
described herein may be recombinantly expressed by molecular cloning into an expression vector 
containing a suitable promoter and other appropriate transcription regulatory elements, and transferred 

10 into prokaryotic or eukaryotic host cells to produce recombinant HPV3 ILL Techniques for such 

manipulations are described in the art (Sambrook et al. Molecular Cloning: A Laboratory Manual; Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New York, (1989); Current Protocols in Molecular 
Biology, Ausubel et al., Green Pub. Associates and Wiley-Interscience, New York (1988); Yeast 
Genetics: A Laboratory Course Manual, Rose et al., Cold Spring Harbor Laboratory, Cold Spring Harbor, 

1 5 New York, (1 990), which are hereby incorporated by reference in their entirety). 

Thus, the present invention further relates to a process for expressing an HPV31 LI 
protein in a recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid 
encoding an HPV3 1 LI protein into a yeast host cell; wherein the nucleic acid molecule is codon- 
optimized for optimal expression in the yeast host cell and; (b) culturing the yeast host cell under 

20 conditions which allow expression of said HPV31 LI protein. 

The present invention also relates to a process for expressing an HPV3 1 LI protein in a 
recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid encoding an HPV3 1 
LI protein into a yeast host cell; wherein the nucleic acid molecule is free from internal transcription 
termination signals which are recognized by yeast and; (b) culturing the yeast host cell under conditions 

25 which allow expression of said HPV31 LI protein. 

This invention further relates to a process for expressing an HPV3 1 LI protein in a 
recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid as set forth in SEQ 
ID NO:2 or SEQ ID NO:3 into a yeast host cell; and, (b) culturing the host cell under conditions which 
allow expression of said HPV31 LI protein. 

30 The synthetic genes of the present invention can be assembled into an expression cassette 

that comprises sequences designed to provide efficient expression of the HPV58 LI protein in the host 
cell. The cassette preferably contains the synthetic gene, with related transcriptional and translations 
control sequences operatively linked to it, such as a promoter, and termination sequences. In a preferred 
embodiment, the promoter is the S. cerevisiae GAL1 promoter, although those skilled in the art will 
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recognize that any of a number of other known yeast promoters such as the GAL10, GAL7, ADH1, TDH3 
or PGK promoters, or other eukaryotic gene promoters may be used. A preferred transcriptional 
terminator is the S. cerevisiae ADH1 terminator, although other known transcriptional terminators may 
also be used. The combination oiGALl promoter -ADH1 terminator is particularly preferred. 
5 Another aspect of this invention is an HPV3 1 virus-like particle (VLP), methods of 

producing HPV3 1 VLPs, and methods of using HPV3 1 VLPs. VLPs can self-assemble when LI , the 
major capsid protein of human and animal papillomaviruses, is expressed in yeast, insect cells, 
mammalian cells or bacteria (for review, see Schiller and Roden, in Papillomavirus Reviews: Current 
Research on Papillomaviruses; Lacey, ed. Leeds, UK: Leeds Medical Information, pp 101-12 (1996)). 

10 Morphologically indistinct HPV VLPs can also be produced by expressing a combination of the LI and 
L2 capsid proteins. VLPs are composed of 72 pentamers of LI in a T=7 icosahedral structure (Baker et 
d.,Biophys. J. 60(6): 1445-56 (1991)). 

VLPs are morphologically similar to authentic virions and are capable of inducing high 
titres of neutralizing antibodies upon administration into an animal. Immunization of rabbits (Breitburd et 

15 aL, J. Virol. 69(6): 3959-63 (1995)) and dogs (Suzich et aL, Proc. Natl Acad. ScL USA 92(25): 1 1553-57 
(1995)) with VLPs was shown to both induce neutralizing antibodies and protect against experimental 
papillomavirus infection. However, because the VLPs do not contain the potentially oncogenic viral 
genome and can self-assemble from a single gene, they present a safe alternative to use of live virus in 
HPV vaccine development (for review, see Schiller and Hidesheim, J. Clin. Virol. 19: 67-74 (2000)). 

20 Thus, the present invention relates to virus-like particles comprised of recombinant L 1 

protein or recombinant LI + L2 proteins of HPV31. 

In a preferred embodiment of the invention, the HPV3 1 VLPs are produced in yeast. In a 
further preferred embodiment, the yeast is selected from the group consisting of: Saccharomyces 
cerevisiae, Hansenula polymorpha, Pichia pastoris, Kluyvermyces fragilis, Kluveromyces lactis, and 

25 Schizosaccharomyces pombe. 

Another aspect of this invention is an HPV31 VLP, which comprises an HPV31 LI 
protein produced by a HPV31 LI gene that is free from internal transcription termination signals that are 
recognized by yeast. 

Yet another aspect of this invention is an HPV3 1 VLP which comprises an HPV31 LI 
30 protein produced by a codon-optimized HPV31 LI gene. In a preferred embodiment of this aspect of the 
invention, the codon-optimized HPV31 LI gene consists essentially of a sequence of nucleotides as set 
forth in SEQ ID NO:2 or SEQ ID NO:3. 

Yet another aspect of this invention is a method of producing HPV3 1 VLPs, comprising: 
(a) transforming yeast with a recombinant DNA molecule encoding HPV3 1 LI protein or HPV31 LI + 
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L2 proteins; (b) cultivating the transformed yeast under conditions that permit expression of the 
recombinant DNA molecule to produce the recombinant HPV31 protein; and (c) isolating the 
recombinant HPV31 protein to produce HPV31 VLPs. 

In a preferred embodiment of this aspect of the invention, the yeast is transformed with a 
5 HPV3 1 LI gene that is free from transcription termination signals that are recognized by yeast. In 
' another preferred embodiment, the yeast is transformed with a codon-optimized HPV3 1 LI gene to 
produce HPV31 VLPs. In a particularly preferred embodiment, the codon-optimized HPV31 LI gene 
consists essentially of a sequence of nucleotides as set forth in SEQ ID NO:2 or SEQ ID NO:3. 

This invention also provides a method for inducing an immune response in an animal 
10 comprising administering HPV3 1 virus-like particles to the animal. In a preferred embodiment, the 

HPV3 1 VLPs are produced by a gene that is free from internal transcription termination sequences that 
are recognized by yeast. In a further preferred embodiment, the HPV31 VLPs are produced by a codon- 
optimized gene. 

Yet another aspect of this invention is a method of preventing or treating HPV-associated 
1 5 cervical cancer comprising administering to a mammal a vaccine comprising HPV3 1 VLPs. In a 
preferred embodiment of this aspect of the invention, the HPV3 1 VLPs are produced in yeast. 

This invention also relates to a vaccine comprising HPV31 virus-like particles (VLPs). 
In an alternative embodiment of this aspect of the invention, the vaccine further 
comprises VLPs of at least one additional HPV type. In a preferred embodiment, the at least one 
20 additional HPV type is selected from the group consisting of: HPV6, HPV1 1, HPV 16, HPV18, HPV33, 
HPV35, HPV39, HPV45, HPV51, HPV52, HPV55, HPV56, HPV58, HPV59, and HPV68. 

In a preferred embodiment of this aspect of the invention, the vaccine further comprises 

HPV16 VLPs. 

In another preferred embodiment of the invention, the vaccine further comprises HPV 16 
25 VLPs and HPV18 VLPs. 

In yet another preferred embodiment of the invention, the vaccine further comprises 
HPV6 VLPs, HPV11 VLPs, HPV16 VLPs and HPV18 VLPs. 

This invention also relates to pharmaceutical compositions comprising HPV 31 virus-like 
particles. Further, this invention relates to pharmaceutical compositions comprising HPV3 1 VLPs and 
30 VLPs of at least one additional HPV type. In a preferred embodiment, the at least one additional HPV 
type is selected from the group consisting of: HPV6, HPV11, HPV16, HPV18, HPV33, HPV35, HPV39, 
HPV45, HPV51, HPV52, HPV55, HPV56, HPV58, HPV59, and HPV68. 

Vaccine compositions of the present invention may be used alone at appropriate dosages 
defined by routine testing in order to obtain optimal inhibition of HPV31 infection while minimizing any 
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potential toxicity. In addition, co-administration or sequential administration of other agents may be 
desirable. 

The amount of virus-like particles to be introduced into a vaccine recipient will depend 
on the immunogenicity of the expressed gene product. In general, an immunologically or 
5 prophylactically effective dose of about 10 ^g to 100 |ig, and preferably about 20 \ig to 60 \ig of VLPs is 
administered directly into muscle tissue. Subcutaneous injection, intradermal introduction, impression 
though the skin, and other modes of administration such as intraperitoneal, intravenous, or inhalation 
delivery are also contemplated. It is also contemplated that booster vaccinations may be provided. 
Parenteral administration, such as intravenous, intramuscular, subcutaneous or other means of 
10 administration with adjuvants such as alum or Merck alum adjuvant, concurrently with or subsequent to 
parenteral introduction of the vaccine of this invention is also advantageous. 

All publications mentioned herein are incorporated by reference for the purpose of 
describing and disclosing methodologies and materials that might be used in connection with the present 
15 invention. Nothing herein is to be construed as an admission that the invention is not entitled to antedate 
such disclosure by virtue of prior invention. 

Having described preferred embodiments of the invention with reference to the 
accompanying drawings, it is to be understood that the invention is not limited to those precise 
embodiments, and that various changes and modifications may be effected therein by one skilled in the art 
20 without departing from the scope or spirit of the invention as defined in the appended claims. 

The following examples illustrate, but do not limit the invention. 

EXAMPLE 1 

Determination of a representative HPV 3 1 LI sequence 

25 The HPV 31 LI wild-type sequence has been described previously (Goldsborough et al., 

Virology 171(1): 306-3 1 1 (1989); Genbank Accession # J04353). It is not uncommon, however, to find 
minor sequence variations between DNAs obtained from clinical isolates. To isolate a representative 
HPV31 LI wild-type sequence, DNA was isolated from three clinical samples previously shown to 
contain HPV 31 DNA. HPV 31 LI sequences were amplified in a polymerase chain reaction (PCR) using 

30 Taq DNA polymerase and the following primers: HPV 31 LI F 5' - CGT CGA CGT AAA CGT GTA 
TCA TAT TTT TTT ACA G - 3' (SEQ ID NO:5) and HPV 31 LIB 5' - CAG ACA CAT GTA TTA 
CAT ACA CAA C - 3' (SEQ ID NO: 6). The amplified products were electrophoresed on agarose gels 
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and visualized by ethidium bromide staining. The ~ 1500 bp LI bands were excised and DNA purified 
using the QIA quick PCR purification kit (Qiagen, Hilden, Germany). The DNA was then ligated to the 
TA cloning vector, pCR-II (Invitrogen Corp., Carlsbad, CA), E. coli transformed, and plated on LB agar 
with ampicillin plus IPTG and X-gal for blue/white colony selection. The plates were inverted and 
5 incubated for 16 hours at 37°C. White colonies were cultured in LB medium with ampicillin, shaking at 
37°C for 16 hours, and minipreps were performed to extract the plasmid DNA. 

To demonstrate the presence of the LI gene in the plasmid, restriction endonuclease 
digestions were conducted and viewed by agarose gel electrophoresis and ethidium bromide staining. 
DNA sequencing was performed on plasmids containing cloned LI from each of the three clinical 

10 isolates. DNA and translated amino acid sequences were compared with one another and the Genbank 
HPV 31 LI sequences. Sequence analysis of the three clinical isolates revealed that no sequence was 
identical to the Genbank sequence. The pCR-H-HPV 31L1/81 clone was chosen to be the representative 
31L1 sequence and is referred to herein as the "31 LI wild-type sequence" (SEQ ID NO:l, see FIGURE 
1). The sequence chosen as 31 LI wild-type contained one silent substitution at nucleotide 1266 and a 

15 change from a C to a G at nucleotide 1295, altering the encoded amino acid from threonine to serine. The 
3 1 LI partial and total rebuilt genes (SEQ ID NOs: 2 and 3, respectively) also encode a serine at this 
location (see FIGURE 1). In all cases, the amino acid sequences are identical. Nucleotides were changed 
in the rebuilt constructs to encode amino acids using yeast-preferred codon sequences and to eliminate 
potential transcription termination signals (see EXAMPLE 2). 

20 The 3 1 LI wild-type sequence was amplified using the LS-101 5 ' - CTC AGA TCT CAC 

AAA ACA AAA TGT CTC TGT GGC GGC CTA GC - 3 ' (SEQ ID NO:7) and LS-102 5' - GAC AGA 
TCT TAC TTT TTA GTT TTT TTA CGT TTT GCT GG - 3' (SEQ ID NO:8) primers to add BgKI 
extensions. PCR was performed using Vent™ DNA polymerase. The PCR product was visualized by 
ethidium bromide staining of an agarose gel. The ~ 1500 bp band was excised and DNA purified using 

25 the QIAEX II gel extraction kit (Qiagen). The PCR product was then digested with BgFLl at 37 °C for 2 
hours and purified using the QIA quick PCR purification kit The BgKl digested 3 1 LI PCR product was 
ligated to BamKl digested pGALl 10 and DH5 E. coli were transformed. Colonies were screened by PCR 
for the HPV 31 LI insert in the correct orientation. Sequence and orientation were confirmed by DNA 
sequencing. The selected clone was named pGALl 10-HPV 31L1 #2. 

30 Maxiprep DNA was then prepared and Saccharomyces cerevisiae were made competent 

and transformed. The yeast transformation was plated in Leu sorbitol top-agar on Leu" sorbitol plates 
and incubated inverted for 3-5 days at 30°C. Colonies were picked and streaked for isolation on Leu' 
sorbitol plates. To induce LI transcription and protein expression, isolated colonies were subsequently 
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grown in 5 ml of 5 X Leu" Ade" sorbitol with 1.6% glucose and 4% galactose in rotating tube cultures at 
30°C. 

EXAMPLE 2 

5 Yeast codon optimization 

Yeast-preferred codons have been described (Sharp and Cowe, Yeast 7: 657-678 (1991)). 
Initially, the middle portion of HPV 31 LI, representing nucleotides 697-1249, was rebuilt utilizing yeast- 
preferred codons. The strategy employed to rebuild was to design long overlapping sense and antisense 
oligomers that span the region to be rebuilt, substituting nucleotides with yeast-preferred codon sequences 

10 while maintaining the same amino acid sequence. These oligomers were used in place of template DNA 
in the PCR reaction. Additional amplification primers were designed and used to amplify the rebuilt 
sequences from template oligomers with Pfu DNA polymerase (Stratagene, La Jolla, CA). The optimal 
conditions for amplification were section-specific; however, most employed a program resembling the 
following: an initial denaturation step of 94°C for 1 minute, followed by 15-25 cycles of 95°C for 30 sec 

15 denature, 55°C for 30 sec anneal, 72°C for 3.5 minutes extension, followed by a 72°C for 10 minute final 
extension and 4°C hold. 

PCR products were examined by agarose gel electrophoresis. Bands of the appropriate 
size were excised and the DNA was gel purified. The amplified fragments were then used as template to 
assemble the 552 nucleotide rebuilt HPV 3 1 middle LI fragment. PCR was then used to amplify the 

20 wild-type nucleotides 1-725 (5'end) and 1221-1515 (3'end). A final PCR using the 5'end, the 3 ? end, and 
the rebuilt middle was performed to generate full-length 31 LI partial rebuild, referred to herein as the 
"31 LI partial rebuild". 

The complete 3 1 LI sequence was also rebuilt with yeast-preferred codons. This 
construct is referred to herein as the "31 LI total rebuild". Nine long overlapping oligomers were used to 

25 generate yeast-preferred codon nucleotide sequences from 1-753 and four long overlapping oligomers 
were used to generate yeast-preferred codon nucleotide sequences from 1207-1515. After amplification 
and gel purification, these fragments, along with the middle rebuilt section described above (nucleotides 
697-1249), were used together in a PCR reaction to generate the full length 31 LI total rebuild sequence. 
This piece was generated with BamHI extensions. The gel purified rebuilt 3 1L1 DNA was digested with 

30 BamHI, ligated to BaviHL digested pGALl 10 expression vector and transformed into E. coli DH5 cells. 
Colonies were screened by PCR for the HPV 31 LI insert in the correct orientation. Sequence and 
orientation were confirmed by DNA sequencing. 
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Plasmid DNA was prepared. S. cerevisiae cells were made competent and transformed. 
The yeast were plated in Leu sorbitol top-agar on Leu sorbitol plates and incubated inverted for 3-5 
days. Colonies were streaked for isolation on Leu- sorbitol plates. Isolated colonies were subsequently 
5 grown in 5 ml of 5 X Leu- Ade- sorbitol with 1 .6% glucose and 4% galactose in rotating tube cultures at 
30°C to induce LI transcription and protein expression. After 48-72 hours, culture volume equivalent to 
an OD600 = 10 was pelleted, supernate removed and the pellets frozen and stored -70°C. 

EXAMPLE 3 

10 RNA preparation 

Cell pellets of transformed yeast, which were induced to express HPV 31 LI by galactose 
induction, were thawed on ice and suspended in 1 ml of cold DEPC-treated water. Cells were pelleted by 
centrifugation and the resulting supernatant was removed. The cell pellet was then resuspended in 400 ^1 
TES (10 mM Tris pH7.0, 10 mM EDTA and 0.5% SDS). An equal volume of AE buffer-saturated 

15 phenol (50 mM NaOAc and 10 mM EDTA) was added. The tube was vortexed for 10 seconds and 
heated to 65°C for 50 minutes with mixing every 10 minutes. The tube was then placed on ice for 5 
minutes, followed by centrifugation at 4°C for 5 minutes. The supernatant was collected and transferred 
to a sterile tube. An additional 400 \i\ of phenol was added, the tube vortexed, placed on ice for 5 
minutes and centrifiiged. The supernatant was transferred to a sterile tube and 400 |il of chloroform 

20 added, mixed and centrifiiged. The supernatant was again collected and transferred to a sterile tube and 
40 \il 3 M Na Acetate pH 5.2 added in addition to 1 ml 100% EtOH. The tube was placed on dry ice for 
one hour, after which it was centrifiiged at high speed to pellet the RNA. The RNA was washed one time 
with 70% EtOH and air-dried. The RNA was then suspended in 100 jal DEPC-treated water and heated to 
65°C for 5 minutes to dissolve. Spectrophotometry was performed to determine the concentration of 

25 RNA in the sample using the assumption that an A260 reading of 1 = 40 |iig/ml RNA when the A260/280 
is 1.7-2.0. 
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EXAMPLE 4 

Northern blot analysis 

Initial analysis of yeast expressing 31 LI wild-type suggested that the expression yield of 
HPV 31 LI protein was considerably less than was expected. To determine if the low expression was 
5 occurring due to a problem at the transcription level versus the translation level, Northern blot analysis of 
the HPV 3 1 LI transcript was performed. Northern blots were made from gels in which RNA from yeast 
expressing HPV16 LI was run with RNA from yeast expressing HPV31 LI on the same gel to compare 
transcript sizes. 

A 1.2% agarose formaldehyde gel was cast. Ten micrograms of RNA was combined 

10 with denaturing buffer (final concentrations: 6% formaldehyde, 45% formamide and 0.9 x MOPS) and 
heated to 55°C for 15 minutes. A one-tenth volume of gel loading buffer was added and the sample 
loaded onto the gel. Electrophoresis was performed at 65 volts in 1 x MOPS buffer for ~ 5 hours. The 
gel was washed for 15 minutes in sterile water followed by two five minute washes in 10 x SSC. The 
RNA was transferred to a Hybond-N+ nylon membrane (Amersham Biosciences, Piscataway, NJ) by 

15 capillary action over 16 hours in 10 x SSC. The RNA was then fixed to the nylon membrane by cross- 
linking using the Amersham cross-linker set for 700 units of energy. After fixing, the nylon membrane 
was allowed to air dry. The membrane was placed in 30 ml Zetaprobe buffer at 55°C for 2 hours after 
which 32P-labeled probes were added and incubated for 16 hours at 53-65°C. The membrane was then 
washed 3 times in 5 X SSC at room temperature for 20 minutes, followed by 2 times in 0.4 x SSC for 20 

20 minutes at room temperature and once at 60°C for 10 minutes. Probe DNA was generated by PCR using 
HPV 31 LI sequence specific sense and antisense primers. The amplified DNA was labeled by treatment 
with polynucleotide kinase (PNK) and y- 32P ATP at 37°C for 1 hour. The blot was wrapped in saran 
wrap and exposed to x-ray film for 16 hours. Upon film development, probe-hybridized RNA was 
detected as a black band on the autoradiograph. 

25 Analysis of the Northern blot described above revealed that the majority of the full- 

length HPV 31 LI wild-type transcripts were considerably smaller than full length (see FIGURE 4). 
However, the 3 1 LI partial rebuild was designed not only to insert yeast-preferred codons in the middle 
of the gene, but also to eliminate any potential sequences resembling yeast transcription termination sites. 
Northern blot analysis clearly showed that upon rebuilding, the length of the 31 LI gene transcript had 

30 significantly increased to a size corresponding with that of the full-length HPV 16 LI transcript (not 

shown). Thus, premature transcription termination is likely to have accounted for a significant portion of 
the low expression yield from the 31 LI wild-type construct. 
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EXAMPLE 5 

HPV 31 LI protein expression 

Frozen yeast cell pellets of galactose induced cultures equivalent to OD600= 10 were 
5 thawed on ice and suspended in 300 pi of PC buffer (100 mM Na2HP04 and 0.5 M NaCl, pH 7.0) with 
2mM PMSF. Acid-washed 0.5mm glass beads were added, ~ 0.5g/tube. The tubes were vortexed for 15 
minutes at 4°C. 7.5 |il of 20% TritonXlOO was added and vortex repeated for 5 minutes at 4°C. The 
tubes were placed on ice for 15 minutes, then centrifixged for 15 minutes at 4°C. The supernate was 
transferred to a sterile microcentrifuge tube and stored at -70°C 

10 

EXAMPLE 6 

Western blot analysis 

Total yeast protein extract from twenty to forty isolated yeast colonies for each HPV 31 
LI construct were analyzed by Western blot to confirm expression of HPV 3 1 LI protein after galactose 
15 induction. 

Ten micrograms of total yeast protein extract was combined with SDS-PAGE loading 
buffer and heated to 95°C for 10 minutes. The proteins were loaded onto an 8% SDS-PAGE gel and 
electrophoresed in Tris-Glycine buffer. After protein separation, the proteins were Western transferred 
from the gel to nitrocellulose and the blot was blocked in 10% non-fat dry milk in TTBS (Tris buffered 

20 saline with Tween -20) for 16 hours. The blot was washed three times in TTBS. Goat anti-trpE-HPV 16 
L 1 serum, a polyclonal serum that cross-reacts with HPV 3 1 L 1 , was applied at a 1 : 1 000 dilution in 
TTBS for 1 hr at room temperature. The blot was washed three times in TTBS and anti-goat-HRP 
conjugated antibody was applied at a 1:2500 dilution in TTBS for 1 hr. The blot was again washed three 
times and ECL™ detection reagent was applied (Amersham Biosciences, Piscataway, NJ). 

25 Autoradiography was then performed. Proteins recognized by the antiserum were visualized by the 
detection reagent as dark bands on the autoradiograph. 

In all cases, the HPV 31 LI protein was detected as a distinct band on the autoradiograph 
corresponding to approximately 55 kD (data not shown). The HPV 16 LI protein was included as a 
positive control on the gels. 

30 
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EXAMPLE 7 

Radioimmunoassay CRIA) 

The yeast cells expressing HPV 31 LI were grown by a variety of methods, including 
rotating tube cultures, shake flasks and fermenters. The yeast were lysed and protein extracts made to 
5 determine the amount of HPV 3 1 LI virus-like particles (VLPs) produced per milligram of total protein. 
To demonstrate HPV 31 LI VLP expression, a portion of each total yeast protein extract was analyzed by 
capture radioimmunoassay (RIA). 

The RIA was performed using a detection monoclonal antibody, H31.A6, that is HPV 
type 31-specific and VLP conformational-specific. H31.A6 is specific for HPV type 31 LI as it is found 

10 to bind intact HPV 31 LI VLPs and does not recognize denatured HPV 31 VLPs. This mAb can be 

subsequently detected by a goat anti-mouse antibody radiolabeled with 1125. Therefore, the counts per 
minute (cpm) values correspond to relative levels of HPV31 LI VLP expression. 

Polystyrene beads were coated with a goat anti-trpE-HPV31 LI polyclonal serum diluted 
1 : 1000 in PBS overnight. The beads were then washed with 5 volumes of sterile distilled water and air- 

15 dried. The antigen, total yeast protein extract from isolated yeast colonies, was then loaded onto the 

beads by dilution in PBS with 1% BSA, 0.1% Tween-20 and 0.1% Na Azide and incubated with rotation 
for one hour. After washing, the beads were distributed one per well in a 20-well polystyrene plate and 
incubated with H3 1 .A6 mAb diluted 1 :50,000 for 17-24 hours at room temperature. The beads were 
washed and.I125 labeled goat anti-mouse IgG was added at an activity range of 23000-27000 cpm per 10 

20 jil. After 2 hours, the beads were washed and radioactive counts were recorded in cpm/ml. Background 
counts from blank wells were subtracted from the total cpm/ml, giving the RIA minus background value. 

Two experiments were performed: in experiment 1, protein extracts from 31 LI wild-type 
and 3 1 LI partial rebuild were compared and in experiment 2, protein extracts from 31 LI partial rebuild 
and 3 1 LI total rebuild were compared (see FIGURE 5). Results indicate that 3 1 LI partial rebuild VLP 

25 expression is 6.9 fold greater than 31 LI wild-type. The 31 LI total rebuild has a 1.7 fold increased 

expression over the 3 1 LI partial rebuild. Therefore, the 3 1 LI expression levels were increased > 7 fold 
by introducing yeast-preferred codon sequences and eliminating potential transcription termination 
signals. 
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EXAMPLE 8 

Transmission electron microscopy 

To demonstrate that the HPV 31 LI protein was in fact self-assembling to form 
pentameric-Ll capsomers, which in turn self-assemble into virus-like particles, a partially purified 31 LI 
5 total rebuild protein extract was subject to transmission electron microscopy (TEM). Yeast were grown 
under small scale fermentation and pelleted. The pellets were subjected to purification treatments. Pellet 
and clarified yeast extracts were analyzed by immunoblot to demonstrate LI protein expression and 
retention through the purification procedure. Clarified yeast extracts were then subjected to 
centrifugation over a 45%-sucrose cushion and the resulting pellet suspended in buffer for TEM analysis 
1 0 (see FIGURE 6). Results indicated that the diameter of the spherical particles in this crude sample ranged 
from between 30 and 60 nm with some particles displaying a regular array of capsomers. 
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WHAT IS CLAIMED IS: 

1. A nucleic acid molecule comprising a sequence of nucleotides that encodes an 
HPV3 1 LI protein as set forth in SEQ ID NO:4, the nucleic acid sequence being codon-optimized for 

5 high level expression in a yeast cell. 

2. A vector comprising the nucleic acid molecule of claim 1 . 

3. A host cell comprising the vector of claim 3. 

10 

4. The host cell of claim 3, wherein the host cell is selected from the group 
consisting of: Saccharomyces cerevisiae, Hansenula pofymorpha, Pichia pastoris, Kluyvermyces fragilis, 
Kluyveromyces lactis, and Schizosaccharomyces pombe. 

15 5 . The host cell of claim 4, wherein the host cell is Saccharomyces cerevisiae. 

6. The nucleic acid molecule of claim 1, wherein the sequence of nucleotides 
comprises a sequence of nucleotides as set forth in SEQ ID NO:2. 

20 7. A vector comprising the nucleic acid molecule of claim 6. 

8. A host cell comprising the vector of claim 7. 

9. The nucleic acid molecule of claim 1, wherein the sequence of nucleotides 
25 comprises a sequence of nucleotides as set forth in SEQ ID NO:3. 

10. A vector comprising the nucleic acid molecule of claim 9. 

11. A host cell comprising the vector of claim 10. 



30 



12. Virus-like particles (VLPs) comprised of recombinant LI protein or recombinant 
LI + L2 proteins of HPV31. 
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13. The VLPs of Claim 12 wherein the recombinant LI protein or the recombinant 
LI + L2 proteins are produced in yeast. 

14. The VLPs of claim 13, wherein the recombinant LI protein or recombinant LI + 
5 L2 proteins are encoded by a codon-optimized HPV3 1 LI nucleic acid molecule. 

15. The VLPs of claim 14, wherein the codon-optimized nucleic acid molecule 
consists essentially of a sequence of nucleotides as set forth in SEQ ID NO:2 or SEQ ID NO:3. 



10 16. A method of producing the VLPs of Claim 14, comprising: 

(a) transforming yeast with a codon-optimized DNA molecule 
encoding HPV31 LI protein or HPV31 LI + L2 proteins; 

(b) cultivating the transformed yeast under conditions that permit 
expression of the codon-optimized DNA molecule to produce a 

1 5 recombinant papillomavirus protein; and 

(c) isolating the recombinant papillomavirus protein to produce the 
VLPs of Claim 14. 



20 



17. A vaccine comprising the VLPs of Claim 14. 

18. Pharmaceutical compositions comprising the VLPs of claim 14. 



19. A method of preventing HPV infection comprising administering the vaccine of 
Claim 17 to a mammal. 

25 

20. A method for inducing an immune response in an animal comprising 
administering the VLPs of Claim 14 to an animal. 



21 . The virus-like particles of Claim 14 wherein the yeast is selected from the group 
30 consisting of Saccharomyces cerevisiae, Hansenula polymorpha, Pichia pastoris, Kluyvermyces fragilis, 

Kluyveromyces lactis, and Schizosaccharomyces pombe. 

22. The virus-like particles of claim 21, wherein the yeast is Saccharomyces cerevisiae. 



-22- 



WO 2004/084831 



PCT/US2004/008677 



23 . The vaccine of claim 1 7, further comprising VLPs of at least one additional HP V 

type. 

24. The vaccine of claim 23, wherein the at least one additional HPV type is selected 
5 from the group consisting of: HPV6, HPV11, HPV 16, HPV18, HPV33, HPV35, HPV39, HPV45, 

HPV51 , HPV52, HPV55, HPV56, HPV58, HPV59, and HPV68. 

25. The vaccine of claim 24, wherein the at least one HPV type comprises HPV16. 
10 26. The vaccine of claim 25, further comprising HPV1 8 VLPs. 

27. The vaccine of claim 26, further comprising HPV6 VLPs and HPV1 1 VLPs. 

28. A nucleic acid molecule comprising a sequence of nucleotides that encodes an 
1 5 HPV3 1 LI protein, the nucleic acid molecule free from transcription termination signals that are 

recognized by yeast. 

29. A vector comprising the nucleic acid molecule of claim 28. 

20 30. A host cell comprising the vector of claim 29. 

3 1 . The host cell of claim 30, wherein the host cell is selected from the group 
consisting of: Saccharomyces cerevisiae, Hansenula polymorphs Pichia pastoris, Kluyvermycesfragilis, 
Kluyveromyces lactis, and Schizosaccharomyces pombe. 



25 



32. The host cell of claim 31, wherein the host cell is Saccharomyces cerevisiae. 



33. The VLPs of claim 13, wherein the recombinant LI protein or recombinant LI + 
L2 proteins are encoded by a HPV31 LI nucleic acid molecule that is free from transcription termination 

30 signals that are recognized by yeast. 

34. A method of producing the VLPs of Claim 33, comprising: 
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(a) transforming yeast with a DNA molecule encoding HPV3 1 LI 
protein or HPV31 LI + L2 proteins, the DNA molecule free from 
transcription termination sequences that are recognized by yeast; 

(b) cultivating the transformed yeast under conditions that permit 
5 expression of the DNA molecule to produce a recombinant 

papillomavirus protein; and 

(c) isolating the recombinant papillomavirus protein to produce the 
VLPs of Claim 33. 

10 3 5 . A vaccine comprising the VLPs of Claim 33 . 

36. Pharmaceutical compositions comprising the VLPs of claim 33. 

37. A method of preventing HPV infection comprising administering the vaccine of 
1 5 Claim 35 to a mammal. 

38. A method for inducing an immune response in an animal comprising 
administering the VLPs of Claim 33 to the animal. 

20 39. The vaccine of claim 35, further comprising VLPs of at least one additional HPV 

type. 

40. The vaccine of claim 39, wherein the at least one additional HPV type is selected 
from the group consisting of: HPV6, HPV1 1, HPV 16, HPV1 8, HPV33, HPV35, HPV39, HPV45, 

25 HPV51, HPV52, HPV55, HPV56, HPV58, HPV59, and HPV68. 

41 . The vaccine of claim 40, wherein the at least one HPV type comprises HPV1 6. 

42. The vaccine of claim 41, further comprising HPV18 VLPs. 

30 

43. The vaccine of claim 42, further comprising HPV6 VLPs and HPV1 1 VLPs. 
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HPV 31 LI nucleotide sequence alignment. 



31 LI wt ( 1) ATGTCTCTGTGGC6GCCTA6CGAG6CTACTGTCTACTTACCACCTGTCCC 

31 LI partial ( 1) 

31 LI total ( 1) T A.A..ATCT..A C G A 

31 LI wt ( 51) AGTGTCTAAAGTTGTAAGCACGGATGAATATGTAACACGAACCAACATAT 

31 LI partial ( 51) 

31 LI total ( 51) ...C G..C..CTCT..C..C C..C..CA C. 

31 LI wt ( 101) ATTATCACGCAGGCAGTGCTAGGCTGCTTACAGTAGGCCATCCATATTAT 

31 LI partial ( 101) 

31 LI total ( 101) .C..C T..TTC AT. .T.G. .C. .C. .T. .C C..C 

31 LI wt ( 151) TCCATACCTAAATCTGACAATCCTAAAAAAATAGTTGTACCAAAGGTGTC 

31 LI partial ( 151) 

31 LI total ( 151) . .T. .C. .A. .G C. .A. .G. .G. .C. .C. .C C. . 

31 LI wt ( 201) AGGA1TACMTATAGGGTATTTAGGGTTCGTTTACCAGATCCAAACAAAT 

31 LI partial ( 201) 

31 LI total ( 201) T. .T. .G C. .A. .C. X. .A. XA.A. .G C G. 

31 LI wt ( 251) nGGATTTCCTGATACATCTTTTTATAATCCTGAAACTCAACGCTTAGTT 

31 LI partial ( 251) 

31 LI total ( 251) .C. .T. .C. .A. .C. X C. X. X. .A C. . .A. A. .G. X 

31 LI wt ( 301) TGGGCCTGTGTTGGTTTAGAGGTAGGTCGCGGGCAGCCATTAGGTGTAGG 

31 LI partial ( 301) 

31 LI total ( 301) T C G. .A. X. . .A. A. .T. .A G C. . 

31 LI wt ( 351) TATTAGTGGTCATCCATTATTAAATAAATTTGATGACACTGAAAACTCTA 

31 LI partial ( 351) 

31 LI total (351) . . XTC C G. .G. X. .G. X. X C 

31 LI wt ( 401) ATAGATATGCCGGTGGTCCTGGCACTGATAATAGGGAATGTATATCAATG 

31 LI partial ( 401) 

31 LI total ( 401) X C. .T A. .T. X. X. X. .A C. ,T. . . 
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31 LI wt ( 451) GATTATAAACAAACACAACTGTGTTTACTT6GTTGCAAACCACCTATT6G 

31 LI partial ( 451) 

31 LI total ( 451) . .C. .C. .G C. . .T GT.G T. .G A. .C. . 

31 LI wt ( 501) AGAGCATTGGGGTAAAGGTAGTCCTTGTAGTAACAATGCTATTACCCCTG 

31 LI partial ( 501) 

31 LI total ( 501) T. .A. .C G. . .TC. . .A. . .TC C C A. 

31 LI wt ( 551) GTGATTGTCCTCCATTAGAATTAAAAAATTCAGTTATACAAGATGGGGAT 

31 LI partial ( 551) 

31 LI total ( 551) ....C A G G. .G. .C. .T. .C. .C C..T.X 

31 LI wt ( 601) ATGGTTGATACAGGCTTTGGAGCTATGGATTTTACTGCTTTACAAGACAC 

31 LI partial ( 601) 

31 LI total ( 601) C. .C. X. .T. X. .T C. X. .C G 

31 LI wt ( 651) TAAMGTMTGTTCCTTTGGACATTTGTM1TCTATTTGTAAATATCCAG 

31 LI partial ( 651) 

31 LI total ( 651) C. .GTC. . X. X. .A C C C G.X.... 

31 LI wt ( 701) ATTATCTTAAAATGGTTGCTGAGCCATATGGCGATACATTAI 1 1 1 1 1 I AT 

31 LI partial (701) C C.X..G.X.X.X 

31 LI total ( 701) X.XTX.X C.....A C C.X.X.X.X.X 

31 LI wt ( 751) TTACGTAGGGMCAMTGTTTGTMGGCATnTTTTAATAGATCAGGCAC 

31 LI partial ( 751) ..G A G C C.X.X.X C 

31 LI total ( 751) ..G A G C C.X.X.X C 

31 LI wt ( 801) GGTTGGTGAATCGGTCCCTACTGACTTATATATTAAAGGCTCCGGTTCAA 

31 LI partial ( 801) C. .A T A. X. . X.G. X. X. .G C. 

31 LI total ( 801) C. .A T A. X. . X.G. X. X. .G C. 

31 LI wt ( 851) CAGCTACTTTAGCTAACAGTACATACTTTCCTACACCTAGCGGCTCCATG 

31 LI partial (851) X CC.G TCC.X C. .A. .T. .ATCT 

31 LI total ( 851) X CC.G TCC.X C. .A. .T. .ATCT 

31 LI wt ( 901) GTTACTTCAGATGCACAAATTTTTAATAAACCATATTGGATGCAACGTGC 

31 LI partial (901) .X.X.X.X..T..G. .C.X.X.X C G 

31 LI total ( 901) . X. X. X. X. .T. .G. X. X. X. X C G 
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31 LI wt ( 951) TCAGGGACACMTMTGGTATnGTTGGGGCMTCAGTTATTTGTTACTG 

31 LI partial (951) A T C..C C T..C...C.G. .C. .G 

31 LI total ( 951) A T C. .C C T. .C. . .C.G. .C. .G. . . . 

31 LI wt (1001) TGGTAGATACCACACGTAGTACCAATATGTCTGTTTGTGCTGCAATTGCA 

31 LI partial (1001) ...X G...TC C C C..T 

31 LI total (1001) ....C G...TC C C C..T 

31 LI wt (1051) MCAGTGATACTACATTTAAMGTAGTMTTTTAAAGAGTATTTAAGACA 

31 LI partial (1051) ...TC...C C. .C. .GTCCTC. . .C. .C. .G CC.G 

31 LI total (1051) . . .TC. . .C C. .C. .GTCCTC. . .C. .C. .G CC.G 

31 LI wt (1101) TGGTGAGGMTTTGATTTACMT1TATATTTCAGTTATGCAAAATAACAT 

31 LI partial (1101) C...C.G C..C..C G G..C..CC 

31 LI total (1101) C...C.G C..C..C G G..C..CC 

31 LI wt (1151) TATCTGCAGACATAATGACATATATTCACAGTATGAATCCTGCTATTTTG 

31 LI partial (1151) .G T C C..C..C C C..CC. 

31 LI total (1151) .G T C C. .C. X; C C. .CC. . 

31 LI wt (1201) GMGAnGGMTTTTGGAnGACCACACCTCCCTCAGGTTCTTTGGAGGA 

31 LI partial (1201) . .G. .C C. .C. .TC T. .A. .T. X 

31 LI total (1201) ..G..C.....C..C..TC T..A..T.X A.. 

31 LI wt (1251) TACCTATAGGTTTGTAACCTCACAGGCCATTACATGTCAAAAAAGTGCCC 

31 LI partial (1251) 

31 LI total (1251) C C. .A. X. X T. .A. .T. X. X GTC. . .T. 

31 LI wt (1301) CCCAAMGCCCMGGMGATCCATnAMGAnATGTATTTTGGGAGGTT 

31 LI partial (1301) 

31 LI total (1301) .A A C C. .G. X. X. X. X A. X 

31 LI wt (1351) MTTTAAMGAAMGTTTTCTGCAGATTTAGATCAGTTTCCACTGGGTCG 

31 LI partial (1351) 

31 LI total (1351) . X. .G. X C T. X. X. X. .A. X. . .T A. 

31 LI wt (1401) CAMTTTTTATTACAGGCAGGATATAGGGCACGTCCTAAATTTAAAGCAG 

31 LI partial (1401) 

31 LI total (1401) A. .G. X. X. .G. .A. .T. .T. X. .A. .TA.A. .A. X. X. X. .T. 
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31 LI wt (1451) 

31 LI partial (1451) 

31 LI total (1451) 

31 LI wt (1501) 

31 LI partial (1501) 

31 LI total (1501) 
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GTAAACGTAGT6CACCCTCAGCATCTACCACTACACCAGCAAAACGTMA 

. . . .GA.ATC. . .T. .A. .T. .T C. .C T. .GA.A. .G 

AAAACTAAAAAGTAA (SEQ ID N0:1) 

(SEQ ID N0:2) 

(SEQ ID N0:3) 
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HPV31 LI total rebuild nucleotide and amino acid sequences. 

MSLW RPS EAT VYLP PVP 
1 ATGTCTTTGT GGAGACCATC TGAAGCTACC GTCTACTTGC CACCAGTCCC 

VSK VVST DEY VTR TNIY 
51 AGTCTCTAAG GTCGTCTCTA CCGACGAATA CGTCACCAGA ACCAACATCT 

YHA G S A RLLT VGH PYY 
101 ACTACCACGC TGGTTCTGCT AGATTGTTGA CCGTCGGTCA CCCATACTAC 

S IPK SDN PKK IVVP KVS 
151 TCTATCCCAA AGTCTGACAA CCCAAAGAAG ATCGTCGTCC CAAAGGTCTC 

GLQ YRVF RVR LPD PNKF 
201 TGGTTTGCAA TACAGAGTCT TCAGAGTCAG ATTGCCAGAC CCAAACAAGT 

GFP DTS FYNP ETQ RLV 
251 TCGGTTTCCC AGACACCTCT TTCTACAACC CAGAAACCCA AAGATTGGTC 

WACV GLE VGR GQPL GVG 
301 TGGGCTTGTG TCGGTTTGGA AGTCGGTAGA GGTCAACCAT TGGGTGTCGG 

ISG HPLL NKF DDT ENSN 
351 TATCTCTGGT CACCCATTGT TGAACAAGTT CGACGACACC GAAAACTCTA 

RYA GGP GTDN REC ISM 
401 ACAGATACGC TGGTGGTCCA GGTACCGACA ACAGAGAATG TATCTCTATG 

DYKQ TQL CLL GCKP PIG 
451 GACTACAAGC AAACCCAATT GTGTTTGTTG GGTTGTAAGC CACCAATCGG 

EHW GKGS PCS NNA ITPG 
501 TGAACACTGG GGTAAGGGTT CTCCATGTTC TAACAACGCT ATCACCCCAG 

DCP PLELKNSVIQDGD 
551 GTGACTGTCC ACCATTGGAA TTGAAGAACT CTGTCATCCA AGACGGTGAC 
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MVDT GFG AMD F T A L QDT 
601 AT6GTCGACA CCGGTTTCGG TGCTATGGAC TTCACCGCTT TGCAAGACAC 

KSN VPLD ICN SIC KYPD 
651 CAAGTCTAAC GTCCCATTGG ACATCTGTAA CTCTATCTGT AAGTACCCAG 

YLK M V A EPYG DTL FFY 
701 ACTACTTGAA GATGGTCGCT GAACCATACG GCGACACCTT GTTCTTCTAC 

LRRE QMF VRH FFNR SGT 
751 TTGCGTAGAG AACAGATGTT CGTAAGGCAC TTCTTCAACA GATCCGGCAC 

VGE SVPT DLY IKG SGST 
801 CGTAGGTGAA TCTGTCCCAA CCGACCTGTA CATCAAGGGC TCCGGTTCCA 

ATL ANS TYFP TPS GSM 
851 CCGCTACCCT GGCTAACTCC ACCTACTTCC CAACTCCATC TGGCTCCATG 

VTSD A Q I FNK PYWM QRA 
901 GTCACCTCCG ACGCTCAGAT CTTCAACAAG CCATACTGGA TGCAGCGTGC 

QGH NNGI CWG NQL FVTV 
951 ACAGGGTCAC AACAACGGTA TCTGTTGGGG TAACCAGCTG TTCGTGACTG 

VDT TRS TNMS VCA A I A 
1001 TGGTCGATAC CACGCGTTCT ACCAACATGT CTGTCTGTGC TGCAATCGCT 

NSDT TFK SSN FKEY LRH 
1051 AACTCTGACA CTACCTTCAA GTCCTCTAAC TTCAAGGAGT ACCTGAGACA 

GEE FDLQ FIF QLC KITL 
1101 TGGTGAGGAA TTCGATCTGC AATTCATCTT CCAGTTGTGC AAGATCACCC 

SAD IMT YIHS MNP AIL 
1151 TGTCTGCTGA CATCATGACC TACATCCACA GTATGAACCC TGCCATCCTG 

EDWN FGL TTP PSGS LED 
1201 GAGGACTGGA ACTTCGGTCT GACCACTCCA CCTTCCGGTT CTTTGGAAGA 
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TYR FVTS Q A I TCQ KSAP 
1251 CACCTACA6A TTCGTCACCT CTCAA6CTAT CACCTGTCAA AAGTCTGCTC 

QKP KED PFKD YVF WEV 
1301 CACAAAAGCC AAAGGAAGAC CCATTCAAGG ACTACGTCTT CTGGGAAGTC 

NLKE KF.S ADL DQFP LGR 
1351 AACTTGAAGG AAAAGTTCTC TGCTGACTTG GACCAATTCC CATTGGGTAG 

KFL LQAG YRA RPK FKAG 
1401 AAAGTTCTTG TTGCAAGCTG GTTACAGAGC TAGACCAAAG TTCAAGGCTG 

KRS APS ASTT TPA KRK 
1451 GTAAGAGATC TGCTCCATCT GCTTCTACCA CCACCCCAGC TAAGAGAAAG 

K T K K * (SEQ ID N0:4) 
1501 AAGACCAAGA AGTAA (SEQ ID NO: 3) 
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Northern Blot Analysis 
31 wt 31 wt 16 Neg 31 R 31 R 



Full Length 
Truncated 
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SEQUENCE LISTING 

<110> Merck & Co., Inc. 

Jansen, Kathrin U. 
Schultz, Loren D. 
Neeper, Michael P. 
Markus, Henry Z. 

<120> OPTIMIZED EXPRESSION OF HPV 31 LI IN 
YEAST 

<130> 21188-PCT 

<150> 60/457,172 
<151> 2003-03-24 

<160> 8 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 1515 
<212> DNA 

<213> HPV31 LI wild-type 
<400> 1 

atgtctctgt ggcggcctag cgaggctact gtctacttac cacctgtccc agtgtctaaa 60 
gttgtaagca cggatgaata tgtaacacga accaacatat attatcacgc aggcagtgct 120 
aggctgctta cagtaggcca tccatattat tccataccta aatctgacaa tcctaaaaaa 180 
atagttgtac caaaggtgtc aggattacaa tatagggtat ttagggttcg tttaccagat 240 
ccaaacaaat ttggatttcc tgatacatct ttttataatc ctgaaactca acgcttagtt 3 00 
tgggcctgtg ttggtttaga ggtaggtcgc gggcagccat taggtgtagg tattagtggt 360 
catccattat taaataaatt tgatgacact gaaaactcta atagatatgc cggtggtcct 420 
ggcactgata atagggaatg tatatcaatg gattataaac aaacacaact gtgtttactt 480 
ggttgcaaac cacctattgg agagcattgg ggtaaaggta gtccttgtag taacaatgct 540 
attacccctg gtgattgtcc tccattagaa ttaaaaaatt cagttataca agatggggat 600 
atggttgata caggctttgg agctatggat tttactgctt tacaagacac taaaagtaat 660 
gttcctttgg acatttgtaa ttctatttgt aaatatccag attatcttaa aatggttgct 720 
gagccatatg gcgatacatt atttttttat ttacgtaggg aacaaatgtt tgtaaggcat 780 
ttttttaata gatcaggcac ggttggtgaa tcggtcccta ctgacttata tattaaaggc 840 
tccggttcaa cagctacttt agctaacagt acatactttc ctacacctag cggctccatg 900 
gttacttcag atgcacaaat ttttaataaa ccatattgga tgcaacgtgc tcagggacac 960 
aataatggta tttgttgggg caatcagtta tttgttactg tggtagatac cacacgtagt 1020 
accaatatgt ctgtttgtgc tgcaattgca aacagtgata ctac&tttaa aagtagtaat 1080 
tttaaagagt atttaagaca tggtgaggaa tttgatttac aatttatatt tcagttatgc 1140 
aaaataacat tatctgcaga cataatgaca tatattcaca gtatgaatcc tgctattttg 1200 
gaagattgga attttggatt gaccacacct ccctcaggtt ctttggagga tacctatagg 1260 
tttgtaacct cacaggccat tacatgtcaa aaaagtgccc cccaaaagcc caaggaagat 1320 
ccatttaaag attatgtatt ttgggaggtt aatttaaaag aaaagttttc tgcagattta 1380 
gatcagtttc cactgggtcg caaattttta ttacaggcag gatatagggc acgtcctaaa 1440 
tttaaagcag gtaaacgtag tgcaccctca gcatctacca ctacaccagc aaaacgtaaa 1500 
aaaactaaaa agtaa 1515 

<210> 2 
<211> 1515 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 31 partial rebuild 
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<400> 2 

atgtctctgt ggcggcctag cgaggctact gtctacttac cacctgtccc agtgtctaaa 60 
gttgtaagca cggatgaata tgtaacacga accaacatat attatcacgc aggcagtgct 120 
aggctgctta cagtaggcca tccatattat tccataccta aatctgacaa tcctaaaaaa 180 
atagttgtac caaaggtgtc aggattacaa tatagggtat ttagggttcg tttaccagat 240 
ccaaacaaat ttggatttcc tgatacatct ttttataatc ctgaaactca acgcttagtt 300 
tgggcctgtg ttggtttaga ggtaggtcgc gggcagccat taggtgtagg tattagtggt 360 
catccattat taaataaatt tgatgacact gaaaactcta atagatatgc cggtggtcct 420 
ggcactgata atagggaatg tatatcaatg gattataaac aaacacaact gtgtttactt 480 
ggttgcaaac cacctattgg agagcattgg ggtaaaggta gtccttgtag taacaatgct 540 
attacccctg gtgattgtcc tccattagaa ttaaaaaatt cagttataca agatggggat 600 
atggttgata caggctttgg agctatggat tttactgctt tacaagacac taaaagtaat 660 
gttcctttgg acatttgtaa ttctatttgt aaatatccag attatcttaa aatggttgct 720 
gagccatacg gcgacacctt gttcttctat ttgcgtagag aacagatgtt cgtaaggcac 780 
ttcttcaaca gatccggcac cgtaggtgaa tctgtcccaa ccgacctgta catcaagggc 840 
tccggttcca ccgctaccct ggctaactcc acctacttcc caactccatc tggctccatg 900 
gtcacctccg acgctcagat cttcaacaag ccatactgga tgcagcgtgc acagggtcac 960 
aacaacggta tctgttgggg taaccagctg ttcgtgactg tggtcgatac cacgcgttct 1020 
accaacatgt ctgtctgtgc tgcaatcgct aactctgaca ctaccttcaa gtcctctaac 1080 
ttcaaggagt acctgagaca tggtgaggaa ttcgatctgc aattcatctt ccagttgtgc 1140 
aagatcaccc tgtctgctga catcatgacc tacatccaca gtatgaaccc tgccatcctg 1200 
gaggactgga acttcggtct gaccactcca ccttccggtt ctttggagga tacctatagg 1260 
tttgtaacct cacaggccat tacatgtcaa aaaagtgccc cccaaaagcc caaggaagat 1320 
ccatttaaag attatgtatt ttgggaggtt aatttaaaag aaaagttttc tgcagattta 1380 
gatcagtttc cactgggtcg caaattttta ttacaggcag gatatagggc acgtcctaaa 1440 
tttaaagcag gtaaacgtag tgcaccctca gcatctacca ctacaccagc aaaacgtaaa 1500 
aaaactaaaa agtaa 1515 

<210> 3 
<211> 1515 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 31 total rebuild 
<400> 3 

atgtctttgt ggagaccatc tgaagctacc gtctacttgc caccagtccc agtctctaag 60 
gtcgtctcta ccgacgaata cgtcaccaga accaacatct actaccacgc tggttctgct 120 
agattgttga ccgtcggtca cccatactac tctatcccaa agtctgacaa cccaaagaag 180 
atcgtcgtcc caaaggtctc tggtttgcaa tacagagtct tcagagtcag attgccagac 240 
ccaaacaagt tcggtttccc agacacctct ttctacaacc cagaaaccca aagattggtc 300 
tgggcttgtg tcggtttgga agtcggtaga ggtcaaccat tgggtgtcgg tatctctggt 360 
cacccattgt tgaacaagtt cgacgacacc gaaaactcta acagatacgc tggtggtcca 420 
ggtaccgaca acagagaatg tatctctatg gactacaagc aaacccaatt gtgtttgttg 480 
ggttgtaagc caccaatcgg tgaacactgg ggtaagggtt ctccatgttc taacaacgct 540 
atcaccccag gtgactgtcc accattggaa ttgaagaact ctgtcatcca agacggtgac 600 
atggtcgaca ccggtttcgg tgctatggac ttcaccgctt tgcaagacac caagtctaac 660 
gtcccattgg acatctgtaa ctctatctgt aagtacccag actacttgaa gatggtcgct 720 
gaaccatacg gcgacacctt gttcttctac ttgcgtagag aacagatgtt cgtaaggcac 780 
ttcttcaaca gatccggcac cgtaggtgaa tctgtcccaa ccgacctgta catcaagggc 840 
tccggttcca ccgctaccct ggctaactcc acctacttcc caactccatc tggctccatg 900 
gtcacctccg acgctcagat cttcaacaag ccatactgga tgcagcgtgc acagggtcac 960 
aacaacggta tctgttgggg taaccagctg ttcgtgactg tggtcgatac cacgcgttct 1020 
accaacatgt ctgtctgtgc tgcaatcgct aactctgaca ctaccttcaa gtcctctaac 1080 
ttcaaggagt acctgagaca tggtgaggaa ttcgatctgc aattcatctt ccagttgtgc 1140 
aagatcaccc tgtctgctga catcatgacc tacatccaca gtatgaaccc tgccatcctg 1200 
gaggactgga acttcggtct gaccactcca ccttccggtt ctttggaaga cacctacaga 1260 
ttcgtcacct ctcaagctat cacctgtcaa aagtctgctc cacaaaagcc aaaggaagac 132 0 
ccattcaagg actacgtctt ctgggaagtc aacttgaagg aaaagttctc tgctgacttg 1380 
gaccaattcc cattgggtag aaagttcttg ttgcaagctg gttacagagc tagaccaaag 1440 
ttcaaggctg gtaagagatc tgctccatct gcttctacca ccaccccagc taagagaaag 1500 
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aagaccaaga agtaa 1515 
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<210> 5 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 5 

cgtcgacgta aacgtgtatc atattttttt acag 34 

<210> 6 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 6 

cagacacatg tattacatac acaac 25 

<210> 7 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 7 

ctcagatctc acaaaacaaa atgtctctgt ggeggectag c 41 

<210> 8 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 



<400> 8 

gacagatctt actttttagt ttttttacgt tttgctgg 
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